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- The MAILING DATE of this communication appears on the cover sheet with the correspondence address 
Period for Reply 

A SHORTENED STATUTORY PERIOD FOR REPLY IS SET TO EXPIRE 3 MONTH(S) FROM 
THE MAILING DATE OF THIS COMMUNICATION. 

- Extensions of time may be available under the provisions of 37 CFR 1 .136(a). In no event, however, may a reply be timely filed 
after SIX (6) MONTHS from the mailing date of this communication. 

• If the period for reply specified above is less than thirty (30) days, a reply within the statutory minimum of thirty (30) days will be considered timely. 

- tf NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 

- Failure to reply within the set or extended period for reply will, by statute, cause the application to become ABANDONED (35 U.S.C. § 133). 
Any reply received by the Office later than three months after the mailing date of this communication, even if timely filed, may reduce any 
earned patent term adjustment. See 37 CFR 1.704(b). 

Status 

1)S Responsive to communication(s) filed on 20 March 2001 . 
2a)n This action is FINAL. 2b)IE This action is non-final. 

3) n Since this application is in condition for allowance except for formal matters, prosecution as to the merits is 

closed in accordance with the practice under Ex parte Quayle, 1935 CD. 11, 453 O.G. 213. 

Disposition of Claims 

4) M Claim(s) 1-57 is/are pending in the application. 

4a) Of the above claim(s) is/are withdrawn from consideration. 

5) n Claim(s) is/are allowed. 

6) 13 Claim(s) 1-57 is/are rejected. 
?)□ Claim(s) is/are objected to. 

8) \3 Claim(s) are subject to restriction and/or election requirement. 

Application Papers 

9) n The specification is objected to by the Examiner. 

10) 13 The drawing(s) filed on 3/20/2001 is/are: a)l3 accepted or b)n objected to by the Examiner. 

Applicant may not request that any objection to the drawing(s) be held in abeyance. See 37 CFR 1 .85(a). 
Replacement drawing sheet(s) including the con-ection is required if the drawing(s) is objected to. See 37 CFR 1.121(d). 

1 1) 0 The oath or declaration is objected to by the Examiner. Note the attached Office Action or form PTO-152. 

Priority under 35 U.S.C. § 119 

12) 0 Acknowledgment is made of a claim for foreign priority under 35 U.S.C. § 119(a)-(d) or (f). 
a)n All b)n Some * 0)0 None of: 

1 .□ Certified copies of the priority documents have been received. 
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DETAILED ACTION 



Information Disclosure Statement 



1 . The information disclosure statement filed 6/27/2002 fails to comply with 37 CFR 
1 .98(a)(2), which requires a legible copy of each U.S. and foreign patent; each publication or that 
portion which caused it to be listed; and all other information or that portion which caused it to 
be listed. It has been placed in the appUcation file, but the information referred to therein has not 
been considered. 



2. The following is a quotation of the second paragraph of 35 U.S. C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

3. Claims 3 and 15-24 are rejected under 35 U.S.C. 1 12, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. 

Claim 3 recites the limitation "the intersection" in line 3. 

Claim 15 recites the limitation "the intersections" in line 10. 

There is insufficient antecedent basis for these limitations in the claims. 



Claim Rejections - 35 USC § 112 
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Claim Rejections - 35 USC §102 



4. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 



(b) the invention was patented or described in a printed publication in this or a foreign country or in pubhc use or on 
sale in this country, more than one year prior to the date of application for patent in the United States. 



5. Claims 1-6, 14, 25-27, 29, and 30 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Erell et al (US. Patent: 5 J 48,489). 
With respect to Claim 1, Erell discloses: 

Generating at least two mixture components for a prior probability describing 
combinations of clean signal feature vectors with obscuring feature vectors (generating mixture 
models describing clean speech in the presence of noise, CoL 5, Lines 35-67)\ and 

Using each mixture component of the prior probability and the noisy signal feature vector 
to identify the clean signal feature vector (estimating a clean speech feature vector, Col. 5, Lines 
35-67), 

With respect to Claim 2, Erell recites: 

One obscuring feature vector is a noise feature vector (noisy speech feature vector, CoL 
5, Lines 35-67), 

With respect to Claim 3, Erell discloses: 

Generating a separate mixture component for the intersection of each noise feature vector 
with distributions of clean signal feature vectors (mixture components representing overlapping 
regions of acoustic space. Col 7, Line 36- Col 8, Line 22). 

With respect to Claim 4, Erell recites: 
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Generating a separate mixture component for each intersection of a distribution of noise 
feature vectors with a distribution of clean signal feature vectors (mixture components 
representing overlapping regions of acoustic space, Col 7, Line 36- Col 8, Line 22), 

With respect to Claim 5, Erell recites: 

The distribution of noise feature vectors is approximated as a Gaussian distribution (Col 
5, Lines 39-43), 

With respect to Claim 6, Erell discloses: 

Generating a separate mixture component for each intersection of a mixture component 
of the mixture of distributions for the noise feature vectors with a distribution of clean signal 
feature vectors (mixture components representing overlapping regions of acoustic space, each 
utilizing a noise probability distribution, Col 7, Line 36- Col 8, Line 22, and Col 5, Line 35- 
Col 6, Line 25). 

With respect to Claim 14, Erell discloses: 

Identifying the clean signal feature vector comprises using algorithms obtained through 
an approximate Bayesian inference technique to identify the clean feature vectors (Col 5, Lines 
60-67). 

With respect to Claim 25, Erell recites: 

Identifying a mixture of distributions that provide prior probabilities for combinations of 
clean signal feature vectors and obscuring feature vectors (generating mixture models describing 
clean speech in the presence of noise, Col 5, Lines 35-67); 

Determining an observation variance to associate with the noisy signal feature vector 
(variance associated with noise, Col 6, Line 25- Col 7, Line 35); and 
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Using the prior probability mixture of distributions and the observation variance to 
identify the clean signal feature vector (detecting speech in the presence of noise, Col 5, Line 
35' CoL 7, Line 35). 

With respect to Claim 26, Erell discloses: 

Determining an observation variance comprises using a closed- form expression to 
determine the variance (CoL 6, Lines 28-67). 
With respect to Claim 27, Erell recites: 

Generating an observed noisy signal feature vector from a noisy signal that is the 
combination of at least the clean signal and the noise signal (observed noise coefficients, Col 6, 
Lines l-67)\ 

Calculating a calculated noisy signal feature vector from the feature vector from the clean 
signal and the feature vector from the noise signal (vector describing speech in the presence of 
noise, Col 4, Line 61- Col 5, Line 17, that would require an inherent step of generating 
separate speech and noise vectors, and an expected noise coefficient. Col 6, Lines l-67)\ 

Comparing the observed noisy signal feature vector to the calculated noisy signal feature 
vector to determine the variance (variance calculation, Col 6, Lines 1-67). 

Claim 29 contains subject matter similar to Claim 1, and thus, is rejected for the same 
reasons. 

Claim 30 contains subject matter similar to Claim 2, and thus, is rejected for the same 
reasons. 
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6. Claims 41 and 43-48 are rejected under 35 U.S.C. 102(b) as being anticipated by 
Yamaguchi et al (U.S. Patent: 6,026,359). 

With respect to Claim 41, Yamaguchi discloses: 

Accessing a distribution of training feature vectors that represents a prior probability of 
combinations of training feature vectors (accessing initial noisy speech HMMs containing output 
probability distribution vectors. Col 11, Lines 23-56, and Col. 10, Lines 24-44); 

Setting an initial value for a component of a clean signal feature vector (initial clean 
speech HMMs, Col. 11, Lines 23-37); 

Determining a revised value for the component of the clean signal feature vector based in 
part on the initial value for the component, the distribution of training feature vectors, and the 
noisy signal feature vector (updating a noisy speech HMM comprising clean speech and noise 
HMMs, Col 11, Lines 23-56); 

Determining whether to accept the revised value as a final value for the component 
(judging whether an HMM update is necessary, Col 16, Lines 30-43); and 

Using the final value for the component to identify the clean signal feature vector (using 
the updated HMM to recognize clean speech in noise. Col 11, Lines 38-56). 

With respect to Claim 43, Yamaguchi further discloses: 

The steps of the method are repeated for a plurality of distributions of training feature 
vectors to generate a final value for a plurality of components of the clean signal feature vector 
(adapting a plurality of speech HMMs having multiple states, Col 11, Lines 38-56). 

With respect to Claim 44, Yamaguchi recites: 
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Determining a revised value comprises determining a revised value based in part on a 
variance associated with the noisy signal feature vector (updating a speech model based upon a 
difference between an initial and observed speech parameter, CoL 7, Lines 54- Col 8, Line 5). 

With respect to Claim 45, Yamaguchi discloses: 

Accessing a distribution comprises accessing a distribution representing a mixture 
component of a mixture of distributions for noise training feature vectors (output probability 
vector, Col. 10, Lines 40-54), 

With respect to Claim 46, Yamaguchi discloses: 

The mixture component is described in part by a variance (change in noise, Col 10, Line 
40' Col 11, Line 15). 

With respect to Claim 47, Yamaguchi teaches the step of accessing a mixture component 
as applied to Claim 45, and additionally discloses: 

Accessing a distribution representing a mixture component of a mixture of distributions 
for channel distortion training feature vectors (Col 14, Lines 16-23). 

Claim 48 contains subject matter similar to Claims 46 and 47, and thus, is rejected for the 
same reasons. 



Claim Rejections - 35 USC § 103 



1. 



The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 



obviousness rejections set forth in this Office action: 



(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
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having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made, 

8. Claims 7-11, 13, 18, 19, 23, 28, 31, and 37-39 are rejected under 35 U.S.C. 103(a) as 

being unpatentable over Erell et al in view of Eberman et al (U.S. Patent: 5,924,065). 

With respect to Claim 7, Erell teaches the method of estimating a clean speech feature 
vector from noise-degraded speech that utilizes mixture components, as applied to Claim 1. 
Erell does not specifically suggest the implementation of a similar method using channel 
distortion feature vectors in place of noise feature vectors, however, Eberman recites: 

One obscuring feature vector is a channel distortion feature vector (detecting a clean 
speech signal within a signal distorted by channel distortion, Col 4, Line 58-Col 6, Line 34). 

Erell and Eberman are analogous art because they are from a similar field of endeavor in 
detecting speech in noise. Thus, it would have been obvious to a person of ordinary skill in the 
art, at the time of invention, to modify the teaching of Erell with the use of a channel distortion 
vector in detecting a clean speech signal vector as taught by Eberman to improve the means of 
identifying and representing a clean speech signal by further considering channel distortion- a 
common and well-known speech degrading effect (Eberman, Col. 1, Lines 32-49, and Col 3, 
Lines 61-67). 

With respect to Claim 8, Erell teaches the mixture generation method as applied to Claim 

3, while Eberman teaches the further use of a channel distortion vector in estimating a clean 
speech vector as applied to Claim 7. 

With respect to Claim 9, Erell teaches the mixture generation method as applied to Claim 

4, while Eberman teaches the further use of a channel distortion vector in estimating a clean 
speech vector as appUed to Claim 7. 
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Claim 10 contains subject matter similar to Claim 5, and thus, is rejected for the same 
reasons. 

With respect to Claim 11, Erell teaches the mixture generation method as appUed to 
Claim 6, while Eberman teaches the further use of a channel distortion vector in estimating a 
clean speech vector as applied to Claim 7. 

With respect to Claim 13, Erell teaches the method of estimating a clean speech feature 
vector from noise-degraded speech that utilizes mixture components, as applied to Claim 1, 
while Eberman teaches the further use of a channel distortion vector in estimating a clean speech 
vector as applied to Claim 7. Although neither Erell nor Eberman specifically suggest utilizing 
multiple channel distortion vectors corresponding to different channels, the examiner takes 
official notice that it is well known in the art to do so in order to detect a clean speech signal in a 
variety of channel distorted environments. Therefore, in order to implement improved clean 
speech detection in a variety of channel distortion environments, it would have been obvious to 
modify the teachings of Erell and Eberman with the use of multiple channel distortion vectors 
corresponding to different channels. 

Claim 18 contains subject matter similar to Claim 7, and thus, is rejected for the same 
reasons. 

Claim 19 contains subject matter similar to Claim 13, and thus, is rejected for the same 
reasons. 

Claim 23 contains subject matter similar to Claims 7 and 22, and thus, is rejected for the 
same reasons. 
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Claim 28 contains subject matter similar to Claims 7 and 27, and thus, is rejected for the 
same reasons. 

Claim 31 contains subject matter similar to Claim 7, and thus, is rejected for the same 
reasons. 

Claim 37 contains subject matter similar to Claim 7, and thus, is rejected for the same 
reasons. 

Claim 38 contains subject matter similar to Claims 2 and 7, and thus, is rejected for the 
same reasons. 

Claim 39 contains subject matter similar to Claim 13, and thus, is rejected for the same 
reasons. 

9. Claims 12, 15-17, 20-22, 24, 32-36, and 40 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Erell et al. 

With respect to Claim 12, Erell teaches the method of estimating a clean speech feature 
vector from noise-degraded speech that utilizes mixture components, as applied to Claim 1. 
Erell does not specifically suggest that clean speech feature vectors are from at least two sources, 
however, the examiner takes official notice that it is well known in the art to utihze feature 
vectors from muhiple sources in order to implement speaker dependent speech recognition. 
Therefore, in order to improve recognition accuracy for individual speakers through speaker 
dependent speech recognition, it would have been obvious to one of ordinary skill in the art, at 
the time of invention, to modify the teachings of Erell with the use of clean speech feature 
vectors from multiple sources. 
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With respect to Claim 15, Erell recites: 

A computer-readable medium comprising computer-executable instructions for 
performing steps comprising: 

Determining the intersections of at least two distributions of obscuring feature vectors 
with at least one distribution of model clean signal feature vectors (mixture components 
representing overlapping regions of acoustic space for multiple frequency channels, Col. 7, Line 
36- Col <§, Line 22); 

Using the intersections to identify at least two mixture components for a probability 
distribution that describes the prior probability of combinations of obscuring feature vectors and 
clean signal feature vectors (mixture components representing overlapping regions of acoustic 
space, Col 7, Line 36- Col 8, Line 22)\ and 

Using the mixture components of the prior probability and the feature vector for the noisy 
signal to identify the feature vector for the clean signal (estimating a clean speech feature vector. 
Col 5, Lines 35-67), 

Erell does not specifically suggest method storage on a computer readable medium, 
however, it would have been obvious to one of ordinary skill in the art, at the time of invention, 
to store the clean speech estimation method taught by Erell on a computer readable medium to 
increase method compatibility and usability by providing a means for method use with multiple 
computer systems. 

Claim 16 contains subject matter similar to Claims 3 and 15, and thus, is rejected for the 
same reasons. 
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Claim 17 contains subject matter similar to Claim 2, and thus, is rejected for the same 
reasons. 

Claim 20 contains subject matter similar to Claim 6, and thus, is rejected for the same 
reasons. 

Claim 21 contains subject matter similar to Claim 5, and thus, is rejected for the same 
reasons. 

With respect to Claim 22, Erell discloses: 

At least two distributions of obscuring feature vectors comprise at least two distributions 
of noise feature vectors (noise distributions for multiple frequency channels, Col 6, Lines 1-24). 

Claim 24 contains subject matter similar to Claim 12, and thus, is rejected for the same 
reasons. 

With respect to Claim 32, Erell recites: 

Accessing a noisy signal feature vector (Col. 4, Line 61- Col 5, Line 17)\ 

Accessing at least one distribution of training feature vectors (accessing a filter banking 

containing noisy speech vectors, Col 4, Line 61- Col 5, Line 17)\ 

Determining a variance for the noisy signal feature vector (variance calculation. Col 6, 

Lines 1-67); 

Identifying a clean signal feature vector from the noisy signal feature vector, the 
distribution of training feature vectors and the variance for the noisy signal feature vector 
(detecting speech in the presence of noise, Col 5, Line 35- Col 7, Line 35). 

Erell does not specifically suggest method storage on a computer readable medium, 
however, it would have been obvious to one of ordinary skill in the art, at the time of invention, 
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to store the clean speech estimation method taught by Erell on a computer readable medium to 
increase method compatibility and usability by providing a means for method use with multiple 
computer systems. 

Claim 33 contains subject matter similar to Claim 26, and thus, is rejected for the same 
reasons. 

With respect to Claim 34, Erell discloses: 

Applying feature vectors to a function to produce a calculated noisy signal feature vector 
(DFTof noise variables, Col 6, Lines 28'67)\ 

Accessing an observed noisy signal feature vector (col 6, Lines 28-67); and 

Comparing the calculated noisy signal feature vector to the observed noisy signal feature 
vector to produce the variance for the noisy signal feature vector (variance calculation, Col 6, 
Lines 1-67), 

With respect to Claim 35, Erell recites: 

The calculated noisy signal feature vector comprises at least one clean signal feature 
vector and at least one obscuring signal feature vector (noisy speech vectors comprising noise 
and speech components, Col 4, Line 61- Col 5, Line 17, and Col 6, Lines 49-67), 

Claim 36 contains subject matter similar to Claim 2, and thus, is rejected for the same 
reasons. 

Claim 40 contains subject matter similar to Claim 12, and thus, is rejected for the same 
reasons. 
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10. Claims 42 and 49-57 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Yamaguchi et al in view of Ramaswamy et al (U.S. Patent: 6 J 88,976). 

With respect to Claim 42, Yamaguchi teaches the clean speech model updating method 
as applied to Claim 41 . Yamaguchi does not specifically suggest deciding not to update 
according to an initial model and then providing a further model update, however, the addition of 
such steps would have been obvious to one of ordinary skill in the art as is evidenced by 
Ramaswamy (implementing an additional model update if an initial model is unacceptable, Col 
3, Line 64- Col. 4, Line 7). 

Yamaguchi and Ramaswamy are analogous art because they are from a similar field of 
endeavor in speech model adaptation. Thus, it would have been obvious to a person of ordinary 
skill in the art, at the time of invention, to modify the teachings of Yamaguchi with the further 
steps of implementing an additional model update if a first update is considered to be 
unacceptable as taught by Ramaswamy to ensure that a clean speech vector is acceptable for use 
in a speech recognition process. 

With respect to Claim 49, Yamaguchi discloses: 

Accessing a noisy signal feature vector (initial noise HMM, Col. 11, Lines 23-56, and 
Col 10, Lines 24-44); 

Accessing at least one distribution of training feature vectors (initial clean speech HMM, 
Col 11, Lines 23-56, and Col 10, Lines 24-44); 

Identifying an initial value for a clean signal feature vector (initial clean speech HMM, 
Col 11, Lines 23-56, and Col. 10, Lines 24-44); and 
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Although Yamaguchi teaches updating a speech HMM as applied to Claim 41 , 
Yamaguchi does not teach an iterative process of further updating an altered speech vector, 
however, Ramaswamy teaches such a function in implementing an additional model update if a 
model is determined to be unacceptable as applied to Claim 42. 

Thus, it would have been obvious to a person of ordinary skill in the art, at the time of 
invention, to modify the teachings of Yamaguchi with the further steps of implementing an 
additional model update if a first update is considered to be unacceptable as taught by 
Ramaswamy to ensure that a clean speech vector is acceptable for use in a speech recognition 
process. 

Claim 50 contains subject matter similar to Claim 44, and thus, is rejected for the same 
reasons. 

Claims 51 and 53 contain subject matter similar to Claim 45, and thus, are rejected for 
the same reasons. 

Claim 52 contains subject matter similar to Claim 46, and thus, is rejected for the same 
reasons. 

Claims 54 and 56 contain subject matter similar to Claim 47, and thus, are rejected for 
the same reasons. 

Claim 55 contains subject matter similar to Claim 48, and thus, is rejected for the same 
reasons. 

With respect to Claim 57, Yamaguchi in view of Ramaswamy teaches the clean speech 
model updating method as apphed to Claim 49. Although neither Yamaguchi nor Ramaswamy 
specifically suggest utilizing multiple channel distortion vectors corresponding to different 
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channels, the examiner takes official notice that it is well known in the art to do so in order to 
detect a clean speech signal in a variety of channel distorted environments. Therefore, in order 
to implement improved clean speech detection in a variety of channel distortion environments, it 
would have been obvious to modify the teachings of Yamaguchi and Ramaswamy with the use 
of multiple channel distortion vectors corresponding to different channels. 

Conclusion 

1 1 . The prior art made of record and not relied upon is considered pertinent to applicant*s 
disclosure. 

• Bahl et al (U.S, Patent: 6,067,517)- teaches a method of feature vector 
classification. 

• Ephraim et al (U.S. Patent: 6,202,047)- discloses a method for estimating a clean 
speech feature vector from a noisy speech vector that utilizes variance. 

• Gong (U.S, Patent: 6,633,842)- teaches a method for detecting a clean speech 
vector in a noisy environment. 

12. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to James S. Wozniak whose telephone number is (703) 305-8669 
and email is James.Wozniak@uspto.gov. The examiner can normally be reached on Mondays- 
Fridays, 8:30-4:30. 
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If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Doris To can be reached at (703) 305-4827. The fax/phone number for the 
Technology Center 2600 where this application is assigned is (703) 872-9306. 

Any inquiry of a general nature or relating to the status of this application or proceeding 
should be directed to the technology center receptionist whose telephone number is (703) 306- 
0377. 

James S. Wozniak 
9/15/2004 




SUSAN MCFADDEN 
PRIMARY EXAMINER 



